Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 9146 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 93 |
| Duplicate rows (%) | 1.0% |
| Total size in memory | 714.7 KiB |
| Average record size in memory | 80.0 B |
Variable types
| NUM | 10 |
|---|
Reproduction
| Analysis started | 2021-04-26 13:39:10.210271 |
|---|---|
| Analysis finished | 2021-04-26 13:40:04.264410 |
| Duration | 54.05 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Dataset has 93 (1.0%) duplicate rows | Duplicates |
size_npear is highly correlated with mass_npea and 2 other fields | High correlation |
mass_npea is highly correlated with size_npear and 4 other fields | High correlation |
damage_size is highly correlated with mass_npea and 2 other fields | High correlation |
exposed_area is highly correlated with mass_npea and 4 other fields | High correlation |
std_dev_malign is highly correlated with mass_npea and 3 other fields | High correlation |
damage_ratio is highly correlated with mass_npea and 1 other fields | High correlation |
| Distinct count | 8847 |
|---|---|
| Unique (%) | 96.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9903.052173627817 |
|---|---|
| Minimum | 2864.76 |
| Maximum | 36995.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 2864.76 |
|---|---|
| 5-th percentile | 4887.2075 |
| Q1 | 6988.42 |
| median | 8895.965 |
| Q3 | 12119.95 |
| 95-th percentile | 17593.025 |
| Maximum | 36995.4 |
| Range | 34130.64 |
| Interquartile range (IQR) | 5131.53 |
Descriptive statistics
| Standard deviation | 4060.577116 |
|---|---|
| Coefficient of variation (CV) | 0.4100328913 |
| Kurtosis | 1.343576214 |
| Mean | 9903.052174 |
| Median Absolute Deviation (MAD) | 2361.61 |
| Skewness | 1.09779161 |
| Sum | 90573315.18 |
| Variance | 16488286.51 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 4000.26 | 5 | 0.1% | |
| 12958.4 | 4 | < 0.1% | |
| 13475.4 | 4 | < 0.1% | |
| 12311.8 | 3 | < 0.1% | |
| 11887.8 | 3 | < 0.1% | |
| 7034.32 | 3 | < 0.1% | |
| 7379.32 | 3 | < 0.1% | |
| 13004.3 | 3 | < 0.1% | |
| 8465.78 | 3 | < 0.1% | |
| 5811.82 | 3 | < 0.1% | |
| 10605.3 | 3 | < 0.1% | |
| 6411.31 | 3 | < 0.1% | |
| 9929.73 | 3 | < 0.1% | |
| 5189.07 | 2 | < 0.1% | |
| 10016.1 | 2 | < 0.1% | |
| 10526.1 | 2 | < 0.1% | |
| 10146.4 | 2 | < 0.1% | |
| 6894.15 | 2 | < 0.1% | |
| 19280.7 | 2 | < 0.1% | |
| 10314.2 | 2 | < 0.1% | |
| 16876.8 | 2 | < 0.1% | |
| 8123.23 | 2 | < 0.1% | |
| 11954.7 | 2 | < 0.1% | |
| 7725.48 | 2 | < 0.1% | |
| 8791.92 | 2 | < 0.1% | |
| Other values (8822) | 9079 | 99.3% |
| Value | Count | Frequency (%) | |
| 2864.76 | 1 | < 0.1% | |
| 3114.98 | 2 | < 0.1% | |
| 3124.09 | 1 | < 0.1% | |
| 3281.67 | 1 | < 0.1% | |
| 3328.18 | 1 | < 0.1% | |
| 3335.6 | 1 | < 0.1% | |
| 3338.36 | 1 | < 0.1% | |
| 3342.18 | 1 | < 0.1% | |
| 3344.43 | 1 | < 0.1% | |
| 3348.79 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 36995.4 | 1 | < 0.1% | |
| 30970.4 | 1 | < 0.1% | |
| 29601.9 | 1 | < 0.1% | |
| 29078.6 | 1 | < 0.1% | |
| 28960.1 | 1 | < 0.1% | |
| 28404.9 | 1 | < 0.1% | |
| 28001.7 | 1 | < 0.1% | |
| 27827.1 | 1 | < 0.1% | |
| 27546.4 | 1 | < 0.1% | |
| 27446.6 | 1 | < 0.1% |
| Distinct count | 8859 |
|---|---|
| Unique (%) | 96.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3032.8278373059266 |
|---|---|
| Minimum | 510.53 |
| Maximum | 13535.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 510.53 |
|---|---|
| 5-th percentile | 1229.72 |
| Q1 | 1983.6575 |
| median | 2684.33 |
| Q3 | 3830.745 |
| 95-th percentile | 5816.2175 |
| Maximum | 13535 |
| Range | 13024.47 |
| Interquartile range (IQR) | 1847.0875 |
Descriptive statistics
| Standard deviation | 1462.334147 |
|---|---|
| Coefficient of variation (CV) | 0.4821685321 |
| Kurtosis | 1.887115716 |
| Mean | 3032.827837 |
| Median Absolute Deviation (MAD) | 842.23 |
| Skewness | 1.169529805 |
| Sum | 27738243.4 |
| Variance | 2138421.156 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1087.13 | 5 | 0.1% | |
| 3493.28 | 4 | < 0.1% | |
| 2025.16 | 3 | < 0.1% | |
| 2470.77 | 3 | < 0.1% | |
| 2004.67 | 3 | < 0.1% | |
| 1053.23 | 3 | < 0.1% | |
| 1541.52 | 3 | < 0.1% | |
| 2061.5 | 3 | < 0.1% | |
| 6172.93 | 3 | < 0.1% | |
| 2533.6 | 3 | < 0.1% | |
| 2034.78 | 3 | < 0.1% | |
| 4331 | 3 | < 0.1% | |
| 3102.87 | 3 | < 0.1% | |
| 2238.05 | 3 | < 0.1% | |
| 4814.93 | 3 | < 0.1% | |
| 4099.99 | 3 | < 0.1% | |
| 2329.12 | 2 | < 0.1% | |
| 2592.09 | 2 | < 0.1% | |
| 5414.72 | 2 | < 0.1% | |
| 1614.48 | 2 | < 0.1% | |
| 2292.14 | 2 | < 0.1% | |
| 3787.25 | 2 | < 0.1% | |
| 2422.5 | 2 | < 0.1% | |
| 3028.79 | 2 | < 0.1% | |
| 1799.5 | 2 | < 0.1% | |
| Other values (8834) | 9077 | 99.2% |
| Value | Count | Frequency (%) | |
| 510.53 | 1 | < 0.1% | |
| 520.33 | 1 | < 0.1% | |
| 572.7 | 1 | < 0.1% | |
| 577.8 | 1 | < 0.1% | |
| 601.21 | 1 | < 0.1% | |
| 623.42 | 1 | < 0.1% | |
| 631.17 | 1 | < 0.1% | |
| 673.61 | 1 | < 0.1% | |
| 678.82 | 1 | < 0.1% | |
| 681.59 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 13535 | 1 | < 0.1% | |
| 12491.1 | 1 | < 0.1% | |
| 10935 | 1 | < 0.1% | |
| 10688.2 | 1 | < 0.1% | |
| 10658.9 | 1 | < 0.1% | |
| 10441.2 | 1 | < 0.1% | |
| 10282.1 | 1 | < 0.1% | |
| 10263.9 | 1 | < 0.1% | |
| 9964.75 | 1 | < 0.1% | |
| 9885.56 | 1 | < 0.1% |
malign_ratio
Real number (ℝ≥0)
| Distinct count | 7386 |
|---|---|
| Unique (%) | 80.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.30308287557402147 |
|---|---|
| Minimum | 0.11482 |
| Maximum | 0.5253 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 0.11482 |
|---|---|
| 5-th percentile | 0.2050725 |
| Q1 | 0.2590525 |
| median | 0.301055 |
| Q3 | 0.3430025 |
| 95-th percentile | 0.4117525 |
| Maximum | 0.5253 |
| Range | 0.41048 |
| Interquartile range (IQR) | 0.08395 |
Descriptive statistics
| Standard deviation | 0.06253294702 |
|---|---|
| Coefficient of variation (CV) | 0.2063229303 |
| Kurtosis | 0.01478203704 |
| Mean | 0.3030828756 |
| Median Absolute Deviation (MAD) | 0.04198 |
| Skewness | 0.2322996952 |
| Sum | 2771.99598 |
| Variance | 0.003910369464 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.26957 | 7 | 0.1% | |
| 0.26628 | 6 | 0.1% | |
| 0.28408 | 5 | 0.1% | |
| 0.32156 | 5 | 0.1% | |
| 0.27382 | 5 | 0.1% | |
| 0.25653 | 5 | 0.1% | |
| 0.24372 | 5 | 0.1% | |
| 0.25271 | 5 | 0.1% | |
| 0.27176 | 5 | 0.1% | |
| 0.3217 | 5 | 0.1% | |
| 0.27037 | 4 | < 0.1% | |
| 0.25551 | 4 | < 0.1% | |
| 0.33982 | 4 | < 0.1% | |
| 0.26721 | 4 | < 0.1% | |
| 0.32367 | 4 | < 0.1% | |
| 0.31073 | 4 | < 0.1% | |
| 0.31537 | 4 | < 0.1% | |
| 0.32613 | 4 | < 0.1% | |
| 0.31598 | 4 | < 0.1% | |
| 0.34859 | 4 | < 0.1% | |
| 0.32154 | 4 | < 0.1% | |
| 0.26591 | 4 | < 0.1% | |
| 0.30964 | 4 | < 0.1% | |
| 0.259 | 4 | < 0.1% | |
| 0.32155 | 4 | < 0.1% | |
| Other values (7361) | 9033 | 98.8% |
| Value | Count | Frequency (%) | |
| 0.11482 | 1 | < 0.1% | |
| 0.12161 | 1 | < 0.1% | |
| 0.12441 | 1 | < 0.1% | |
| 0.126 | 1 | < 0.1% | |
| 0.12815 | 1 | < 0.1% | |
| 0.12861 | 1 | < 0.1% | |
| 0.12975 | 1 | < 0.1% | |
| 0.13024 | 1 | < 0.1% | |
| 0.13089 | 1 | < 0.1% | |
| 0.13512 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.5253 | 1 | < 0.1% | |
| 0.52428 | 1 | < 0.1% | |
| 0.52374 | 1 | < 0.1% | |
| 0.51992 | 1 | < 0.1% | |
| 0.51873 | 1 | < 0.1% | |
| 0.51821 | 1 | < 0.1% | |
| 0.51569 | 1 | < 0.1% | |
| 0.51396 | 1 | < 0.1% | |
| 0.51379 | 1 | < 0.1% | |
| 0.51123 | 1 | < 0.1% |
| Distinct count | 8861 |
|---|---|
| Unique (%) | 96.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103.90211786573366 |
|---|---|
| Minimum | 10.3101 |
| Maximum | 346.42 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 10.3101 |
|---|---|
| 5-th percentile | 40.462475 |
| Q1 | 64.012525 |
| median | 88.4583 |
| Q3 | 134.209 |
| 95-th percentile | 207.808 |
| Maximum | 346.42 |
| Range | 336.1099 |
| Interquartile range (IQR) | 70.196475 |
Descriptive statistics
| Standard deviation | 55.45686176 |
|---|---|
| Coefficient of variation (CV) | 0.5337413991 |
| Kurtosis | 1.383014273 |
| Mean | 103.9021179 |
| Median Absolute Deviation (MAD) | 29.2055 |
| Skewness | 1.223522874 |
| Sum | 950288.77 |
| Variance | 3075.463516 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 33.732 | 5 | 0.1% | |
| 162.594 | 4 | < 0.1% | |
| 93.9039 | 3 | < 0.1% | |
| 184.361 | 3 | < 0.1% | |
| 84.1988 | 3 | < 0.1% | |
| 51.6741 | 3 | < 0.1% | |
| 150.493 | 3 | < 0.1% | |
| 49.8648 | 3 | < 0.1% | |
| 168.55 | 3 | < 0.1% | |
| 44.7148 | 3 | < 0.1% | |
| 90.6229 | 3 | < 0.1% | |
| 76.4392 | 3 | < 0.1% | |
| 207.505 | 3 | < 0.1% | |
| 73.6682 | 3 | < 0.1% | |
| 52.5591 | 3 | < 0.1% | |
| 67.4358 | 2 | < 0.1% | |
| 60.2829 | 2 | < 0.1% | |
| 160.318 | 2 | < 0.1% | |
| 177.66 | 2 | < 0.1% | |
| 70.3361 | 2 | < 0.1% | |
| 67.5195 | 2 | < 0.1% | |
| 236.084 | 2 | < 0.1% | |
| 154.49 | 2 | < 0.1% | |
| 179.652 | 2 | < 0.1% | |
| 115.411 | 2 | < 0.1% | |
| Other values (8836) | 9078 | 99.3% |
| Value | Count | Frequency (%) | |
| 10.3101 | 1 | < 0.1% | |
| 10.6891 | 1 | < 0.1% | |
| 11.5576 | 1 | < 0.1% | |
| 16.7026 | 1 | < 0.1% | |
| 19.2305 | 1 | < 0.1% | |
| 20.8594 | 1 | < 0.1% | |
| 21.3794 | 1 | < 0.1% | |
| 22.0497 | 1 | < 0.1% | |
| 22.2613 | 1 | < 0.1% | |
| 23.112 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 346.42 | 1 | < 0.1% | |
| 344.346 | 1 | < 0.1% | |
| 344.093 | 1 | < 0.1% | |
| 342.271 | 2 | < 0.1% | |
| 340.422 | 1 | < 0.1% | |
| 340.062 | 1 | < 0.1% | |
| 337.931 | 1 | < 0.1% | |
| 337.505 | 1 | < 0.1% | |
| 336.478 | 1 | < 0.1% | |
| 335.532 | 1 | < 0.1% |
| Distinct count | 8949 |
|---|---|
| Unique (%) | 97.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1372441.9647981194 |
|---|---|
| Minimum | 387853.4025 |
| Maximum | 4978616.0418 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 387853.4025 |
|---|---|
| 5-th percentile | 673233.7165 |
| Q1 | 959687.2643 |
| median | 1237057.06 |
| Q3 | 1693083.245 |
| 95-th percentile | 2448493.426 |
| Maximum | 4978616.042 |
| Range | 4590762.639 |
| Interquartile range (IQR) | 733395.9805 |
Descriptive statistics
| Standard deviation | 564677.287 |
|---|---|
| Coefficient of variation (CV) | 0.4114398288 |
| Kurtosis | 1.208698938 |
| Mean | 1372441.965 |
| Median Absolute Deviation (MAD) | 336057.5122 |
| Skewness | 1.064545725 |
| Sum | 1.255235421e+10 |
| Variance | 3.188604385e+11 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 569494.3892 | 5 | 0.1% | |
| 1798705.203 | 4 | < 0.1% | |
| 799977.1539 | 3 | < 0.1% | |
| 1186263.571 | 3 | < 0.1% | |
| 963134.9793 | 3 | < 0.1% | |
| 831886.2766 | 3 | < 0.1% | |
| 1877843.547 | 3 | < 0.1% | |
| 1502130.372 | 3 | < 0.1% | |
| 1106670.117 | 2 | < 0.1% | |
| 820445.0505 | 2 | < 0.1% | |
| 976360.2624 | 2 | < 0.1% | |
| 2215735.422 | 2 | < 0.1% | |
| 885777.1915 | 2 | < 0.1% | |
| 1438462.489 | 2 | < 0.1% | |
| 1851013.608 | 2 | < 0.1% | |
| 2931521.359 | 2 | < 0.1% | |
| 959554.8586 | 2 | < 0.1% | |
| 1702364.263 | 2 | < 0.1% | |
| 2346247.828 | 2 | < 0.1% | |
| 1217529.716 | 2 | < 0.1% | |
| 732592.6312 | 2 | < 0.1% | |
| 1178899.861 | 2 | < 0.1% | |
| 1297715.057 | 2 | < 0.1% | |
| 2064925.227 | 2 | < 0.1% | |
| 1615553.814 | 2 | < 0.1% | |
| Other values (8924) | 9085 | 99.3% |
| Value | Count | Frequency (%) | |
| 387853.4025 | 1 | < 0.1% | |
| 423115.4973 | 2 | < 0.1% | |
| 427736.5329 | 1 | < 0.1% | |
| 452984.9483 | 1 | < 0.1% | |
| 453815.2734 | 1 | < 0.1% | |
| 454396.2795 | 1 | < 0.1% | |
| 456432.7619 | 1 | < 0.1% | |
| 457690.3496 | 1 | < 0.1% | |
| 457924.3026 | 1 | < 0.1% | |
| 458243.5296 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4978616.042 | 1 | < 0.1% | |
| 4256876.116 | 1 | < 0.1% | |
| 4172679.498 | 1 | < 0.1% | |
| 3981595.216 | 1 | < 0.1% | |
| 3864242.825 | 1 | < 0.1% | |
| 3853609.234 | 1 | < 0.1% | |
| 3833944.925 | 1 | < 0.1% | |
| 3807277.967 | 1 | < 0.1% | |
| 3784682.868 | 1 | < 0.1% | |
| 3759338.507 | 1 | < 0.1% |
| Distinct count | 8802 |
|---|---|
| Unique (%) | 96.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 146.30423893505358 |
|---|---|
| Minimum | 31.9704 |
| Maximum | 528.89 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 31.9704 |
|---|---|
| 5-th percentile | 62.781925 |
| Q1 | 95.8539 |
| median | 126.1385 |
| Q3 | 182.2515 |
| 95-th percentile | 278.92425 |
| Maximum | 528.89 |
| Range | 496.9196 |
| Interquartile range (IQR) | 86.3976 |
Descriptive statistics
| Standard deviation | 70.51217743 |
|---|---|
| Coefficient of variation (CV) | 0.4819558062 |
| Kurtosis | 1.23506292 |
| Mean | 146.3042389 |
| Median Absolute Deviation (MAD) | 38.4627 |
| Skewness | 1.151081569 |
| Sum | 1338098.569 |
| Variance | 4971.967166 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 46.039 | 5 | 0.1% | |
| 211.935 | 4 | < 0.1% | |
| 114.353 | 3 | < 0.1% | |
| 90.1626 | 3 | < 0.1% | |
| 137.35 | 3 | < 0.1% | |
| 178.987 | 3 | < 0.1% | |
| 112.873 | 3 | < 0.1% | |
| 198.959 | 3 | < 0.1% | |
| 111.301 | 3 | < 0.1% | |
| 140.507 | 3 | < 0.1% | |
| 65.2332 | 3 | < 0.1% | |
| 227.605 | 3 | < 0.1% | |
| 148.631 | 3 | < 0.1% | |
| 138.837 | 3 | < 0.1% | |
| 115.196 | 2 | < 0.1% | |
| 206.916 | 2 | < 0.1% | |
| 131.078 | 2 | < 0.1% | |
| 256.887 | 2 | < 0.1% | |
| 160.421 | 2 | < 0.1% | |
| 108.768 | 2 | < 0.1% | |
| 113.117 | 2 | < 0.1% | |
| 132.87 | 2 | < 0.1% | |
| 138.156 | 2 | < 0.1% | |
| 203.172 | 2 | < 0.1% | |
| 83.7198 | 2 | < 0.1% | |
| Other values (8777) | 9079 | 99.3% |
| Value | Count | Frequency (%) | |
| 31.9704 | 1 | < 0.1% | |
| 33.5225 | 1 | < 0.1% | |
| 34.1339 | 1 | < 0.1% | |
| 34.5528 | 2 | < 0.1% | |
| 35.4824 | 1 | < 0.1% | |
| 36.0849 | 1 | < 0.1% | |
| 37.2144 | 1 | < 0.1% | |
| 38.5446 | 1 | < 0.1% | |
| 39.0518 | 1 | < 0.1% | |
| 39.2869 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 528.89 | 1 | < 0.1% | |
| 470.477 | 1 | < 0.1% | |
| 461.06 | 1 | < 0.1% | |
| 452.554 | 1 | < 0.1% | |
| 449.5 | 1 | < 0.1% | |
| 444.056 | 1 | < 0.1% | |
| 430.45 | 1 | < 0.1% | |
| 425.373 | 1 | < 0.1% | |
| 424.572 | 1 | < 0.1% | |
| 424.309 | 1 | < 0.1% |
err_malign
Real number (ℝ≥0)
| Distinct count | 8831 |
|---|---|
| Unique (%) | 96.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3992.9362562869014 |
|---|---|
| Minimum | 1089.19 |
| Maximum | 91983.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 1089.19 |
|---|---|
| 5-th percentile | 2187.3375 |
| Q1 | 3177.6825 |
| median | 3846.32 |
| Q3 | 4664.5775 |
| 95-th percentile | 5777.33 |
| Maximum | 91983.7 |
| Range | 90894.51 |
| Interquartile range (IQR) | 1486.895 |
Descriptive statistics
| Standard deviation | 1780.672859 |
|---|---|
| Coefficient of variation (CV) | 0.4459557441 |
| Kurtosis | 758.4896593 |
| Mean | 3992.936256 |
| Median Absolute Deviation (MAD) | 739.25 |
| Skewness | 18.40752405 |
| Sum | 36519395 |
| Variance | 3170795.832 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1773.46 | 5 | 0.1% | |
| 4827.72 | 4 | < 0.1% | |
| 4628.02 | 4 | < 0.1% | |
| 4414.34 | 3 | < 0.1% | |
| 5709.89 | 3 | < 0.1% | |
| 4644.75 | 3 | < 0.1% | |
| 3069.56 | 3 | < 0.1% | |
| 4602.96 | 3 | < 0.1% | |
| 3034.98 | 3 | < 0.1% | |
| 3495.27 | 3 | < 0.1% | |
| 3011.5 | 3 | < 0.1% | |
| 4057.08 | 3 | < 0.1% | |
| 4037.82 | 3 | < 0.1% | |
| 3543.18 | 2 | < 0.1% | |
| 4211.51 | 2 | < 0.1% | |
| 5227.71 | 2 | < 0.1% | |
| 4865.39 | 2 | < 0.1% | |
| 4161.14 | 2 | < 0.1% | |
| 3837.05 | 2 | < 0.1% | |
| 2856.23 | 2 | < 0.1% | |
| 3243.29 | 2 | < 0.1% | |
| 3458.4 | 2 | < 0.1% | |
| 6220.49 | 2 | < 0.1% | |
| 3870.47 | 2 | < 0.1% | |
| 3286.63 | 2 | < 0.1% | |
| Other values (8806) | 9079 | 99.3% |
| Value | Count | Frequency (%) | |
| 1089.19 | 1 | < 0.1% | |
| 1108.9 | 1 | < 0.1% | |
| 1137.38 | 1 | < 0.1% | |
| 1157 | 1 | < 0.1% | |
| 1176.53 | 1 | < 0.1% | |
| 1196.94 | 1 | < 0.1% | |
| 1215.42 | 1 | < 0.1% | |
| 1241.68 | 1 | < 0.1% | |
| 1245.95 | 1 | < 0.1% | |
| 1254.14 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 91983.7 | 1 | < 0.1% | |
| 53031.3 | 1 | < 0.1% | |
| 46427.71 | 1 | < 0.1% | |
| 22963.56 | 1 | < 0.1% | |
| 22838.07 | 1 | < 0.1% | |
| 19881.74 | 1 | < 0.1% | |
| 19424.18 | 1 | < 0.1% | |
| 18453.85 | 1 | < 0.1% | |
| 18034.9 | 1 | < 0.1% | |
| 18030.86 | 1 | < 0.1% |
malign_penalty
Real number (ℝ≥0)
| Distinct count | 315 |
|---|---|
| Unique (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.84966105401269 |
|---|---|
| Minimum | 0 |
| Maximum | 340 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 31 |
| median | 54 |
| Q3 | 91 |
| 95-th percentile | 183.75 |
| Maximum | 340 |
| Range | 340 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 55.7853324 |
|---|---|
| Coefficient of variation (CV) | 0.798648577 |
| Kurtosis | 3.194113489 |
| Mean | 69.84966105 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 1.654867254 |
| Sum | 638845 |
| Variance | 3112.003312 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 32 | 140 | 1.5% | |
| 41 | 129 | 1.4% | |
| 40 | 122 | 1.3% | |
| 33 | 121 | 1.3% | |
| 43 | 120 | 1.3% | |
| 17 | 112 | 1.2% | |
| 29 | 111 | 1.2% | |
| 34 | 110 | 1.2% | |
| 36 | 108 | 1.2% | |
| 27 | 106 | 1.2% | |
| 35 | 106 | 1.2% | |
| 39 | 104 | 1.1% | |
| 30 | 104 | 1.1% | |
| 58 | 103 | 1.1% | |
| 44 | 103 | 1.1% | |
| 22 | 103 | 1.1% | |
| 12 | 102 | 1.1% | |
| 54 | 100 | 1.1% | |
| 26 | 100 | 1.1% | |
| 31 | 99 | 1.1% | |
| 45 | 99 | 1.1% | |
| 23 | 99 | 1.1% | |
| 37 | 98 | 1.1% | |
| 21 | 95 | 1.0% | |
| 18 | 94 | 1.0% | |
| Other values (290) | 6458 | 70.6% |
| Value | Count | Frequency (%) | |
| 0 | 2 | < 0.1% | |
| 1 | 28 | 0.3% | |
| 2 | 25 | 0.3% | |
| 3 | 10 | 0.1% | |
| 4 | 39 | 0.4% | |
| 5 | 52 | 0.6% | |
| 6 | 16 | 0.2% | |
| 7 | 41 | 0.4% | |
| 8 | 21 | 0.2% | |
| 9 | 43 | 0.5% |
| Value | Count | Frequency (%) | |
| 340 | 1 | < 0.1% | |
| 337 | 1 | < 0.1% | |
| 336 | 1 | < 0.1% | |
| 334 | 1 | < 0.1% | |
| 333 | 2 | < 0.1% | |
| 332 | 2 | < 0.1% | |
| 329 | 3 | < 0.1% | |
| 328 | 1 | < 0.1% | |
| 327 | 2 | < 0.1% | |
| 326 | 2 | < 0.1% |
| Distinct count | 8641 |
|---|---|
| Unique (%) | 94.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.461651924338504 |
|---|---|
| Minimum | 15.228 |
| Maximum | 46.5464 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 15.228 |
|---|---|
| 5-th percentile | 23.147375 |
| Q1 | 30.290225 |
| median | 35.24575 |
| Q3 | 38.806075 |
| 95-th percentile | 43.4149 |
| Maximum | 46.5464 |
| Range | 31.3184 |
| Interquartile range (IQR) | 8.51585 |
Descriptive statistics
| Standard deviation | 5.972808003 |
|---|---|
| Coefficient of variation (CV) | 0.1733175187 |
| Kurtosis | -0.2699688638 |
| Mean | 34.46165192 |
| Median Absolute Deviation (MAD) | 3.9416 |
| Skewness | -0.4626063578 |
| Sum | 315186.2685 |
| Variance | 35.67443544 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 46.5464 | 93 | 1.0% | |
| 44.4892 | 5 | 0.1% | |
| 23.015 | 4 | < 0.1% | |
| 39.7659 | 4 | < 0.1% | |
| 28.4308 | 4 | < 0.1% | |
| 36.8564 | 3 | < 0.1% | |
| 33.1548 | 3 | < 0.1% | |
| 27.968 | 3 | < 0.1% | |
| 36.7848 | 3 | < 0.1% | |
| 34.8833 | 3 | < 0.1% | |
| 35.6615 | 3 | < 0.1% | |
| 37.8556 | 3 | < 0.1% | |
| 31.7747 | 3 | < 0.1% | |
| 19.8525 | 3 | < 0.1% | |
| 29.8554 | 3 | < 0.1% | |
| 32.8819 | 3 | < 0.1% | |
| 38.4272 | 3 | < 0.1% | |
| 39.2259 | 3 | < 0.1% | |
| 36.3038 | 3 | < 0.1% | |
| 37.2619 | 3 | < 0.1% | |
| 39.1113 | 3 | < 0.1% | |
| 29.7563 | 3 | < 0.1% | |
| 38.9741 | 3 | < 0.1% | |
| 37.6793 | 3 | < 0.1% | |
| 28.7144 | 3 | < 0.1% | |
| Other values (8616) | 8976 | 98.1% |
| Value | Count | Frequency (%) | |
| 15.228 | 1 | < 0.1% | |
| 15.6443 | 1 | < 0.1% | |
| 15.8688 | 1 | < 0.1% | |
| 16.4815 | 1 | < 0.1% | |
| 16.6089 | 1 | < 0.1% | |
| 16.6222 | 1 | < 0.1% | |
| 16.7684 | 1 | < 0.1% | |
| 16.8136 | 1 | < 0.1% | |
| 16.8555 | 1 | < 0.1% | |
| 16.8967 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 46.5464 | 93 | 1.0% | |
| 46.2986 | 1 | < 0.1% | |
| 45.9443 | 1 | < 0.1% | |
| 45.9023 | 1 | < 0.1% | |
| 45.7252 | 1 | < 0.1% | |
| 45.7218 | 1 | < 0.1% | |
| 45.66 | 1 | < 0.1% | |
| 45.5965 | 1 | < 0.1% | |
| 45.5846 | 1 | < 0.1% | |
| 45.5318 | 1 | < 0.1% |
tumor_size
Real number (ℝ≥0)
| Distinct count | 6511 |
|---|---|
| Unique (%) | 71.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.723348239667614 |
|---|---|
| Minimum | 0.0 |
| Maximum | 20.999 |
| Zeros | 56 |
| Zeros (%) | 0.6% |
| Memory size | 71.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.2325 |
| Q1 | 2.32 |
| median | 5.0605 |
| Q3 | 13.336 |
| 95-th percentile | 18.61375 |
| Maximum | 20.999 |
| Range | 20.999 |
| Interquartile range (IQR) | 11.016 |
Descriptive statistics
| Standard deviation | 6.086852455 |
|---|---|
| Coefficient of variation (CV) | 0.7881105792 |
| Kurtosis | -1.123046703 |
| Mean | 7.72334824 |
| Median Absolute Deviation (MAD) | 3.474 |
| Skewness | 0.5772015728 |
| Sum | 70637.743 |
| Variance | 37.0497728 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 56 | 0.6% | |
| 2.006 | 8 | 0.1% | |
| 1.896 | 7 | 0.1% | |
| 2.243 | 7 | 0.1% | |
| 1.529 | 7 | 0.1% | |
| 2.012 | 7 | 0.1% | |
| 2.85 | 7 | 0.1% | |
| 1.645 | 6 | 0.1% | |
| 1.611 | 6 | 0.1% | |
| 2.891 | 6 | 0.1% | |
| 1.849 | 6 | 0.1% | |
| 2.227 | 6 | 0.1% | |
| 1.82 | 6 | 0.1% | |
| 1.73 | 6 | 0.1% | |
| 15.852 | 6 | 0.1% | |
| 2.573 | 5 | 0.1% | |
| 2.74 | 5 | 0.1% | |
| 1.755 | 5 | 0.1% | |
| 1.898 | 5 | 0.1% | |
| 1.651 | 5 | 0.1% | |
| 2.112 | 5 | 0.1% | |
| 1.856 | 5 | 0.1% | |
| 1.756 | 5 | 0.1% | |
| 1.781 | 5 | 0.1% | |
| 2.085 | 5 | 0.1% | |
| Other values (6486) | 8949 | 97.8% |
| Value | Count | Frequency (%) | |
| 0 | 56 | 0.6% | |
| 0.401 | 1 | < 0.1% | |
| 0.429 | 1 | < 0.1% | |
| 0.431 | 3 | < 0.1% | |
| 0.437 | 1 | < 0.1% | |
| 0.453 | 2 | < 0.1% | |
| 0.455 | 1 | < 0.1% | |
| 0.472 | 1 | < 0.1% | |
| 0.481 | 1 | < 0.1% | |
| 0.484 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 20.999 | 1 | < 0.1% | |
| 20.985 | 1 | < 0.1% | |
| 20.983 | 1 | < 0.1% | |
| 20.97 | 1 | < 0.1% | |
| 20.959 | 1 | < 0.1% | |
| 20.954 | 1 | < 0.1% | |
| 20.948 | 1 | < 0.1% | |
| 20.944 | 1 | < 0.1% | |
| 20.941 | 1 | < 0.1% | |
| 20.939 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| mass_npea | size_npear | malign_ratio | damage_size | exposed_area | std_dev_malign | err_malign | malign_penalty | damage_ratio | tumor_size | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 6930.90 | 2919.02 | 0.42116 | 51.8298 | 9.888294e+05 | 109.4870 | 2758.76 | 72 | 39.3620 | 14.103 |
| 1 | 15635.70 | 4879.36 | 0.31206 | 223.5500 | 2.058426e+06 | 248.8810 | 5952.53 | 240 | 22.0253 | 2.648 |
| 2 | 10376.20 | 2613.88 | 0.25191 | 127.3370 | 1.434676e+06 | 160.0930 | 4635.26 | 73 | 29.9963 | 1.688 |
| 3 | 13093.80 | 4510.06 | 0.34444 | 155.4400 | 1.812195e+06 | 173.0150 | 5273.87 | 32 | 28.1354 | 3.796 |
| 4 | 7545.21 | 2882.36 | 0.38201 | 85.1237 | 1.043918e+06 | 124.4140 | 3263.35 | 57 | 35.0200 | 18.023 |
| 5 | 6851.09 | 2195.18 | 0.32041 | 72.8283 | 9.484467e+05 | 97.1881 | 3688.57 | 40 | 36.3481 | 1.709 |
| 6 | 7069.24 | 1886.09 | 0.26680 | 58.2686 | 1.024783e+06 | 76.9447 | 3168.44 | 22 | 39.9961 | 9.937 |
| 7 | 16446.10 | 5115.45 | 0.31104 | 204.9740 | 2.167355e+06 | 265.8810 | 6425.91 | 242 | 22.9533 | 2.510 |
| 8 | 6814.73 | 2043.78 | 0.29990 | 90.0889 | 9.669835e+05 | 99.6286 | 3428.54 | 27 | 37.1642 | 12.568 |
| 9 | 5049.75 | 949.82 | 0.18809 | 41.2957 | 6.906229e+05 | 70.4142 | 2734.59 | 27 | 41.1366 | 13.428 |
Last rows
| mass_npea | size_npear | malign_ratio | damage_size | exposed_area | std_dev_malign | err_malign | malign_penalty | damage_ratio | tumor_size | |
|---|---|---|---|---|---|---|---|---|---|---|
| 9136 | 10704.20 | 3134.65 | 0.29284 | 99.6031 | 1.455340e+06 | 135.7840 | 5941.13 | 37 | 30.1301 | 15.570 |
| 9137 | 8584.66 | 2601.47 | 0.30303 | 96.4928 | 1.215774e+06 | 112.3210 | 3605.01 | 94 | 36.7163 | 2.123 |
| 9138 | 20392.00 | 7400.07 | 0.36289 | 185.2990 | 2.783620e+06 | 277.5200 | 4502.82 | 127 | 29.6816 | 8.639 |
| 9139 | 10363.60 | 2083.95 | 0.20108 | 87.8054 | 1.447825e+06 | 122.8710 | 3988.78 | 156 | 31.8885 | 4.139 |
| 9140 | 11679.40 | 2603.31 | 0.22289 | 154.5190 | 1.650030e+06 | 169.7800 | 4607.99 | 55 | 29.9713 | 0.915 |
| 9141 | 7250.25 | 3120.63 | 0.43041 | 82.0410 | 9.794768e+05 | 118.7710 | 3370.24 | 53 | 37.0260 | 13.127 |
| 9142 | 10145.00 | 3544.90 | 0.34942 | 90.1403 | 1.374393e+06 | 154.0270 | 5025.50 | 30 | 31.0565 | 17.091 |
| 9143 | 8086.10 | 1621.65 | 0.20054 | 78.5118 | 1.134257e+06 | 104.2840 | 3804.98 | 13 | 34.2739 | 1.971 |
| 9144 | 14418.90 | 6373.71 | 0.44203 | 84.0665 | 1.955398e+06 | 246.4450 | 19881.74 | 39 | 34.5885 | 17.749 |
| 9145 | 6852.61 | 1584.64 | 0.23124 | 51.3211 | 9.559976e+05 | 80.6543 | 3073.51 | 28 | 37.8939 | 14.103 |
Most frequent
| mass_npea | size_npear | malign_ratio | damage_size | exposed_area | std_dev_malign | err_malign | malign_penalty | damage_ratio | tumor_size | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 4000.26 | 1087.13 | 0.27176 | 33.7320 | 5.694944e+05 | 46.0390 | 1773.46 | 12 | 44.4892 | 1.730 | 4 |
| 67 | 12958.40 | 3493.28 | 0.26957 | 162.5940 | 1.798705e+06 | 211.9350 | 4827.72 | 50 | 28.4308 | 2.740 | 4 |
| 12 | 5811.82 | 1053.23 | 0.18122 | 52.5591 | 7.999772e+05 | 65.2332 | 3034.98 | 24 | 39.7659 | 4.070 | 3 |
| 19 | 6411.31 | 2061.50 | 0.32154 | 44.7148 | 8.318863e+05 | 90.1626 | 3011.50 | 55 | 38.9741 | 3.429 | 3 |
| 23 | 7034.32 | 2238.05 | 0.31816 | 76.4392 | 9.631350e+05 | 111.3010 | 3069.56 | 82 | 39.1113 | 1.539 | 3 |
| 69 | 13475.40 | 4814.93 | 0.35731 | 168.5500 | 1.877844e+06 | 227.6050 | 4644.75 | 99 | 29.7563 | 7.190 | 3 |
| 0 | 3944.70 | 1086.38 | 0.27540 | 34.5915 | 5.580284e+05 | 45.4132 | 1780.16 | 12 | 41.1904 | 2.006 | 2 |
| 2 | 4025.81 | 1206.57 | 0.29970 | 32.7381 | 5.553452e+05 | 51.3178 | 1399.10 | 19 | 44.3150 | 1.447 | 2 |
| 3 | 4045.88 | 1385.75 | 0.34250 | 35.1162 | 5.581583e+05 | 50.8689 | 1388.97 | 17 | 44.2919 | 1.904 | 2 |
| 4 | 4914.20 | 1260.66 | 0.25653 | 38.8667 | 6.855367e+05 | 58.6129 | 1894.31 | 29 | 43.3704 | 12.135 | 2 |